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Final  Report 


This  is  the  final  report  for  the  four  year  URI  to  Clarkson  University  and  a 
subcontract  to  Boston  University. 

During  the  duration  of  this  project,  we  have  developed  several  new 
algorithms  for  simulation  of  microstructures,  devices,  and  fabrication  methods 
for  IC  technology.  We  have  interacted  extensively  with  both  industrial  and 
government  labs  and  collaborated  with  personnel  of  both. 

We  have  developed  algorithms  for  imaging  of  microstructures,  investigated 
the  effects  of  off-axis  illumination  and  finite-thickness  effects  on  mask  imaging. 
Furthermore,  we  have  investigated  the  effects  of  post-exposure  baking  on 
chemically  amplified  resists,  optimized  stepper  parameters,  as  well  as  proximity 
effects  on  mask  making.  Last,  but  not  least,  we  developed  new  visualization 
algorithms  that  allow  multiple  viewing  of  the  microchip  processes. 

We  have  collaborated  extensively  with  Air  Force/Lincoln  Labs  personnel 
and  have  published  three  joint  papers  with  them.  We  have  also  transferred  our 
developments  to  the  microchip  facility  of  the  NSA,  Wright  Patterson  AFB,  and 
the  NRL.  Our  industrial  collaboration  with  AMD,  IBM,  HP,  DEC,  LSI,  Micon, 
and  TI  has  continued  and  expanded. 

The  material  in  this  report  is  organized  as  follows.  One,  a  list  of 
publications  supported  by  this  project.  Two,  a  report  on  our  new  visualization 
algorithm;  three,  several  selected  papers  supported  under  this  project. 

During  the  tenure  of  this  project  the  ideas  of  the  nonuniform  grid  FFT  have 
been  extended  and  the  algorithm  developed  for  its  implementation  integrated 
into  our  various  simulation  algorithms,  including  our  Helmholtz  solver. 
Furthermore,  the  exposure-kinetics  of  photoresist  exposure  with  partially 
coherent  light  has  been  addressed  and  implemented,  yielding  simulations  that 
are  very  close  to  the  observed  res.ults. 

We  summarize  this  introduction  with  the  comment  that  much  progress  has 
been  made  in  the  simulation  process  of  microchip  fabrication  with  the  support 
provided  from  this  grant. 
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n2dX  -  A  Multiple-Image  Color  Contour  Plotter 

"n2dX"  is  a  color-contour  plotting  program  designed  to 

graphically  display  the  results  when  several 

computational  runs  are  made  varying  one  parameter  (defocus, 

for  example)  over  a  range  of  values.  The  outout  files  for 

the  run  come  out  with  extensions  numbered  sequentially  starting  at  zero 

In  n2dX's  main  window,  merely  enter  the  "base  name"  (omitting 

the  numerical  extension) ,  and  the  number  of  plots  in  the  run. 

"n2dX"  will  then  load  them  all  simultaneously.  Its  general  operation 
is  then  the  same  as  "2dX",  except  that  zooming  and  slice  plotting 
affect  all  plots  simultaneously.  On  each  plot's  control  panel 
there  is  a  button  that,  when  clicked,  cycles  among  the  images, 
essentially  "animating"  visually  the  effect  of  varying  the  parameter 
from  one  end  of  its  range  to  the  other. 

There  is  also  a  button,  labeled  "MultiPlot",  that  will  display 
that  window's  images  side-by-side  in  a  large  window.  This  display 
can  be  terminated  either  by  typing  "q",  or  by  closing  the  window, 

A  single  plot,  one  not  part  of  a  series,  can  be  viewed  by  entering 
its  full  name  and  giving  "1"  as  the  number  of  images. 

Slices: 

Another  feature  of  "n2dX"  is  plotting  the  value  of  the  function 
along  a  line  segment,  either  horizontal  or  vertical.  With  the 
cursor,  mark  a  rectangle  having  as  one  of  its  sides  the  line 
along  which  you  want  function  values  plotted.  This  line  will 
then  be  either  the  "N"  (North,  or  top),  "E"  (East,  or  right), 

"S"  (South,  or  bottom),  or  "W"  (West,  or  right)  edge  of  this 
rectangle.  Make  sure  that  this  rectangle  is  highlighted 
(Click  on  "Choose  Region"  until  it  is) ,  click  on  the  panel  "Direction" 
button  until  the  correct  edge  is  selected.  Click  on  "Plot  Slice" 
and  the  function  plot  will  appear  next  to  the  color  contour  plot . 

While  the  cursor  is  in  the  slice-plot  window,  the  x,  y,  and 
function  value  of  that  point  will  be  displayed  on  the  window's 
control  panel,  so  values  don't  need  to  be  read  by  eye  off  of  the 
plot. 

Grids : 

One  the  main  window,  there  are  buttons  selecting  the  gridding 
style.  "No  Grid",  "Element  Grid",_  or  "Full  Mesh"  can  be  .selected. 

The  next  time  a  window  is  redrawn,  the  appropriate  grid  lines 
will  be  displayed.  It  might  be  necessary  to  click  in  a  window 
once  or  twice  to  force  it  to  be  redrawn. 


OPC  Corrections  Algorithms  with  Resist  Processing 

Eytan  Barouch  and  Uwe  Hollerbach 
Department  of  Manufacturing  Engineering 
Boston  University 
Boston,  Mass. 

Abstract 

We  have  developed  a  new  optical  proximity  correction  (OPC)  algo¬ 
rithm  based  in  part  on  aerial  image  and  in  part  on  resist  development. 

Resist  development  considerations  include  full  3D  simulations,  SEM  in- 
■  formation,  as  well  as  quick  simple  modeling.  A  new  regula-falsi-based 
optimization  scheme  with  multiple  constraints  has  been  introduced. 


1.  Introduction 

In  the  past  two  years  optical  proximity  corrections  have  been  examiner]  in  great  detail 
and  have  been  reported  in  the  literature  (SPIE  1995,  PMJ  1995).  It  was  clear  from  the 
beginning  that  careless  corrections,  including  line  shortenings  (LS),  line  narrowing  (LN), 
and  corner  rounding  (CR),  can  cause  features  to  bridge,  at  least  optically.  To  further 
complicate  matters,  a  mask  layout  that  appears  nearly  perfect  at  zero  defocus  can  have 
very  poor  performance  at  end-range  defocus,  and  vice-versa.  Furthermore,  depending  on 
resist  and  layout,  the  aerial  image  serves  at  least  as  a  very  good  indicator  of  how  developed 
features  will  appear.  The  indicator  is  not  uniform  and  in  some  cases  processing  (exposure, 
post-exposure  bake,  and  dissolution)  must  be  taken  into  account.  Due  to  the  vast  variety 
of  features  and  patterns,  it  is  impossible  to  predict  globally  on  a  feature-by-feature  basis 
the  precision  of  the  aerial  image  approximation  to  the  processed  resist.  The  typical  range 
of  influence  is  2  to  4  microns,  while  feature  sizes  are  below  0.5  microns,  or  roughly  one 
wavelength  in  size. 

In  a  previous  set  of  publications  (SPIE  95,  PMJ  95)  we  have  reported  a  new  algorithm 
based  entirely  on  a  threshold  aerial  image  model  that  in  most  cases  works  extremely  well 
and  that  is  based  on  a  constrained  Newton  optimization  scheme.  However  the  algorithm 
assumes  a  priori  an  uncorrected  mask.  Furthermore,  the  aerial  image  (FAIM)  on  which 
it  is  based  and  the  optical  design  rule  checker  (ODRC)  have  been  parallelized.  In  certain 
tightly-packed  lines  (inverted  transistor-like  design)  line  shortening  takes  place,  and  if  the 
line  shortening  is  allowed  to  fully  correct  the  aerial  image  bridges  with  the  transverse  line. 
Thus,  an  algorithm  based  on  aerial  image  alone  will  require  a  redesign  of  the  device’s 
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assembly,  which  is  an  extremely  undesirable  situation,  and  which  should  be  used  only  as 
an  absolute  leist  resort. 

^  In  situations  of  this  sort,  a  much  better  guide  to  the  OPC  is  undoubtedly  the  developed 
resist  pattern,  either  simulated  or  SEMs.  This  is  indeed  the  main  criterion.  We  have  there¬ 
fore  redeveloped  major  parts  of  the  algorithm  OPTIMASK  and  named  it  EOPTIMASK 
in  honor  of  Eric  Jackson,  who  played  a  significant  role  in  developing  this  code. 

EOPTIMASK  has  the  capability  of  incorporating  aerial  image  and  a  Gaussian  con¬ 
volution  (or  any  other  convolution)  of  the  aerial  image  to  represent  a  resist  image,  and 
the  ability  to  take  into  account  full  3D  simulation  as  well  as  developed  micrograph  sur¬ 
faces.  It  has  been  fully  parallelized  on  Cray  T3D  and  IBM  SP2  platforms.  In  all  cases  it  is 
completely  linear,  allowing  full  chip  pre-  and  post-processing,  using  domain  decomposition. 

Several  examples  are  given. 


2.  Description 

EOPTIMASK  accepts  a  FAIM  input  file  containing  fractured  and  flattened  patterns 
of  the  CAD  layout  along  with  the  optical  parameters  of  the  system.  Furthermore,  it 
optionally  takes  either  a  3D  developed  resist  surface  or  a  convolution  function  that  is 
convolved  with  the  aerial  image  to  approximately  represent  the  developed  surface. 

Once  the  surface  criterion  and  its  level  (threshold)  is  established  the  design  rule  checker 
is  engaged  to  determine  where  the  threshold  exceeds  CD  and  where  the  surface  gradient  is 
below  acceptable  contrast  level.  This  takes  place  at  both  0  defocus  as  well  as  at  end-range 
defocus.  The  length  and  direction  of  each  failure  is  recorded  in  three  different  types;  line 
narrowing,  line  shortening,  and  corner  rounding. 

-  ^ 

The  line-narrowing  segments  are  averaged  over  the  defocus  range.  This  is  particularly 
critical  in  isolated  lines  where  the  aerial  image  has  a  tendency  to  spill  out  of  the  feature 
while  the  end-range  defocus  contour  is  embedded  inside  the  feature.  When  that  happens 
the  mask  cannot  be  corrected  for  this  defocus  range.  As  discussed  in  the  introduction,  the 
fashion  in  which  line  shortening  takes  place  is  strongly  dependent  on  the  physical  inter¬ 
action  amongst  the  features.  The  diffraction  patterns  obtained  from  isolated  and  packed' 
lines  are  very  different.  Therefore  a  complete  global  uniform  line  shortening  correction 
algorithm  is  beyond  our  reach.  Once  the  lengthening  of  the  shortened  line  is  achieved, 
the  corner  corrections  are  done  via  serifs  whose  lengths  are  determined  by  the  failure  and 
whose  widths  are  dependent  initially  on  the  failure  at  the  midpoint.  Once  the  initial  line 
movements  have  been  determined,  each  movement  is  bracketed  on  both  sides  to  determine 
optimal  design  under  the  restriction  that  two  features  cannot  be  closer  to  each  other  than 
a  given  value.  A  second  set  of  restrictions  prohibits  correction  on  segments  smaller  than 
a  multiple  of  the  minimum  feature  size  and  restricts  it  to  a  fixed  spot  size,  assuring  that 
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the  affected  mask  is  manufacturable.  The  iterations  take  place,  in  principle,  on  all  correc¬ 
tions.  A  regula-falsi  type  algorithm  is  imposed  and  it  exits  when  the  mask  can’t  be  further 
corrected.  Finally,  the  algorithm  selects  the  best  of  the  last  three  masks  and  reports  it. 

3.  Physical  Considerations 

The  process  of  resist  development  involves  three  basic  physical  steps: 

a)  Exposure:  This  step  is  largely  dominated  by  a  Maxwell-Material  equation 
coupled  to  the  kinetics  of  the  photoactive  compound  changes  during  exposure. 

b)  Post- Exposure  Baking  (PEB):  This  is  largely  controlled  by  diffusion  or 
reaction-diffusion  systems. 

c)  Dissolution  (Resist  Development):  The  physics  of  this  process  can  be 
understood  as  a  reaction-diffusion  equation  of  the  dissolver  penetrating  the  resist 

•  •  film  and  removing  the  molecules  that  can  be  removed.  From  energy  considerations 
of  front  tracking,  a  Hamilton- Jacobi  principle  is  fully  capable  of  describing  the 
dissolution. 

d)  Plasma  etching:  This  process  can  be  described  as  a  system  of  hyperbolic 
conservation  laws. 

In  principle  these  basic  processes  transform  the  aerial  image  surface  into  a  developed 
(or  etched)  surface  pattern.  Naively,  one  might  consider  convolving  the  aerial  image  with 
an  integral  operator  to  obtain  the  final  resist  surfaces.  The  speed  of  convolution  and 
its  simplicity  make  this  operation  very  tempting,  and  in  many  cases  it  yields  reasonable- 
looking  results.  However,  the  inherent  problem  remains:  How  does  one  know  whether  the 
particular  mask  design  is  truly  represented  by  a  convolution  of  the  aerial  image  with  a 
filter-like  function?  Furthermore,  from  a  strict  mathematical  viewpoint  the  existence  of 
such  a  convolution  implies  linear  operation.  If  the  process  is  truly  linear,  its  ingredients 
(a,  b,  c,  d  above)  must  be  linear  as  well.  One  might  argue  that  b.  is  linear  or  nearly- linear, 
and  a.  is  approximately  linear,  but  neither  the  Hamilton- Jacobi  energy  balance  nor  the 
hyperbolic  conservation  law  can  be  approximated  by  a  linear  system.  In  particular,  the 
dissolution,  in  its  simplest  form,  can  be  represented  locally  by  the  eikonal  equation,  where 
the  square  of  the  surface  gradient  equals  the  square  of  the  dissolution  front  velocity.  The 
same  notions  can  easily  be  demonstrated  for  plasma  etching  systems.  The  representation 
of  a  highly-nonlinear  system  by  a  linear  operator  which  works  for  some  cases  is  a  very 
dangerous  one  since  it  can  lead  to  uncontrolled  errors  no  matter  how  expertly  chosen  are 
the  parameters  of  the  convolution.  However,  at  the  small  window  level  it  may  work,  and 
a  highly-windowed  algorithm  such  as  EOPTIMASK  could  serve  as  a  good  candidate  to 
optimize  an  entire  chip. 
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Clearly  the  best  way  to  bias  the  mask  is  to  have  a  large  number  of  micrographs 
where  correlations  between  mask  design  and  printed  features  can  be  obtained  by  multiple- 
regression  analysis.  However,  this  is  a  very  expensive  and  impractical  way  to  proceed,  and 
it  can  work  at  best  at  the  local  level.  The  same  is  true  for  3D  resist  simulation  since  all 
computations  in  3D  are  somewhat  cumbersome  and  time-consuming.  Therefore  a  local 
correlation  can  be  tested  for  each  window  where  the  resist  patterns  are  simulated  locally. 
We  believe  that  a  global  correlation  function  between  aerial  image  and  developed  resist  is 
unobtainable.  Furthermore,  attempting  to  find  a  linear  operator  that  tranforms  the  aerial, 
image  to  the  developed  surface  is  an  extremely  ill-posed  problem  for  which  there  is  no 
guaranteed  solution,  and  when  a  solution  exists,  it  is  almost  certainly  not  unique.  In  other 
words,  it  is  a  procedure  doomed  to  failure. 


4.  Examples 

We  concentrate  in  this  section  on  demonstrating  the  concept  of  our  discussion.  We 
have  chosen  several  test  masks  to  illustrate  the  proximity  effects  and  the  connection  of 
proximity  corrections  to  3D  resist  images.  In  Figure  lA  we  show  the  corrected  mask 
of  an  inverted-like  transistor  system.  The  corrected  mask  has  0.4  micron  lines,  a  space 
between  them  of  .4  microns,  and  .5  micron  distance  to  the  outer  edges.  The  proximity 
correction,  once  chosen  as  nonaggressive,  corrects  for  all  the  line  shortening.  Figure  IB 
shows  both  the  uncorrected  and  the  corrected  images  superimposed  on  the  layers.  Figure 
1C  shows  the  corrected  image  alone,  superimposed  on  the  original  design.  As  can  be  seen, 
a  small  amplitude  oscillation  begins  to  develop  as  the  vertical  lines  attempt  to  approach  the 
hoizontal  bar.  This  takes  place  at  .6  microns  defocus,  rendering  impossible  the  complete 
correction  of  this  structure  via  aerial  image.  We  have  performed  the  full  3D  simulation 
of  this  design,  with  the  aerial  image  displayed  in  Figure  2k.  In  Figure  2B  we  show  the 
developed  resist  images  of  the  corrected  image  and  it  can  be  seen  very  clearly  that  a  fair 
bit  of  the  line  shortening  and  the  line  narrowing  displayed  in  Figure  2C  at  0  defocus  and 
Figure  2D  at  .6  microns  defocus  has  disappeared.  This  resist  image  was  obtained  for 
the  AZ-Spectralith  resist,  which  was  chosen  for  the  sake  of  the  current  simulation  for  its 
extreme  properties.  Specifically,  this  resist  happens  to  have  a  dissolution  rate  function 
with  a  sharp  cutoff  as  measured  by  N.  Eib  and  reported  earlier. 

In  the  next  example,  we  consider  a  system  of  six  parallel  lines  and  spaces  with  .7 
microns  pitch.  Using  the  Spectralith  resist,  we  see  the  corner  rounding  at  both  0  and 
end-range  defocus.  However,  the  line  shortening  and  .6  microns  defocus  as  well  as  the 
line  narrowing  are  more  severe,  as  expected.  In  Figures  3C  and  3D  we  show  the  actual 
line  shortening  for  a  single  isolated  line  at  .4  microns  feature  size  and  0  and  .6  microns 
defocus.  The  conclusions  that  we  draw  from  these  images  are  very  simple  and  very  con¬ 
sistent.  Isolated  lines  have  the  aerial  image  extended  outwards  at  0  defocus,  while  packed 
lines  perform  better.  The  situation  at  .6  microns  defocus  is  reversed.  The  CD  control 
is  relatively  good  for  isolated  lines  while  it  is  beginning  to  degrade  at  end-range  defocus. 
These  figures  illustrate  unequivocally  that  even  though  automated  proximity  effects  can 
improve  the  resist  images,  it  is  not  a  quick  fix  solution.  The  physics  of  the  process  must 
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be  better  understood  to  achieve  better  simulation.  The  images  discussed  so  far  come  from 
very  simple  designs  that  contain  most  of  the  inherent  difBculties  associated  with  a  large 
mask  OPC. 

Having  both  the  aerial  image  and  the  resist  surface,  our  attempt  to  find  a  precise 
linear  integral  operator  was  a  failure.  Also,  the  convolution  with  Gaussian  variances  of  .01 
resulted  in  significantly  degraded  resist  images  as  compared  with  the  actual  simulations. 

For  consistency  with  our  previous  OPTIMASK  code,  we  illustrate  the  results  of  EOP- 
TIMASK  on  both  the  “nikon  mask”  and  our  standard  SRAM  test  case.  In  both  cases 
the  result  of  EOPTIMASK  are  superior  to  the  previous  algorithm.  These  two  images  are 
displayed  on  Figures  4 A  and  4B. 

Last  but  not  least,  in  figure  5A  we  show  an  IBM  test  case  of  62  by  85  microns  with. 
.35  feature  size.  In  Figure  5B  the  corrected  mask  is  illustrated.  In  Figure  5C  we  show’  the 
superimposed  aerial  image  from  the  corrected  mask  where  lim-opt  played  a  major  role. 
The  failures  are  relatively  high  at  .6  microns  defocus  and  EOPTIMASK  reduced  the  total 
failure  area  by  about  95%  in  most  of  the  cases.  The  relaxation  of  the  lim-opt  shows  in 
Figure  5D  clear  bridging  of  the  critical  features.  This  bridging  again  cannot  be  corrected 
by  a  simple  linear  operation. 


5.  Conclusion 

Optical  Proximity  Correction  is  a  powerful  tool  to  bias  masks  wdthin  the  manufac¬ 
turing  range.  How'ever  it  is  not  always  possible  to  achieve  complete  correction.  The 
complexity  of  the  design  certainly  affects  the  globality  of  the  process  and  it  is  somewhat 
wishful  thinking  to  correct  a  whole  mask  in  one  pass.  In  many  cases  EOPTIMASK  works 
very  well.  In  some  cases  it  is  insufficient  and  in  some  cases  the  mask  needs  to  be  redesigned. 
The  user  and  the  designer  need  to  evaluate  the  results  in  order  for  successful  layout  to  be 
manufactured. 
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Abstract 

Recent  advances  in  computational  microelectronics  have  made  it 
possible  to  compute  the  images  of  very  large  masks.  Images  of  entire 
masks  require  many  gigabytes  of  storage.  It  is  therefore  desirable  to 
make  storage  requirements  as  small  as  possible.  In  this  paper  we  inves¬ 
tigate  the  effects  of  reducing  resolution  to  the  point  where  some  aliasing 
error  enters  the  final  answer.  Due  to  the  accuracy  requirements  of  the 
microelectronics  industry,  we  conclude  that  reducing  resolution  to  im¬ 
prove  storage  requirements  is  not  feasible. 

1.  Introduction 

Recently,  it  has  become  possible  to  compute  quite  exactly  the  images  of  very  large 
mask  patterns  used  in  the  microelectronics  industry.^’^  One  effect  of  this  new  capability 
is  that  the  size-limiting  factor  has  become  storage  rather  than  computation  time.  For 
example,  we  have  computed  the  image  of  one  layer  of  an  entire  SRAM  mask.  The  compu¬ 
tation  time  for  this  mask  was  about  24  hours  on  a  32-node  IBM  PVS  parallel  processor, 
and  the  storage  required  for  the  final  answer  was  approximately  5  Gb.  This  computation, 
and  others  like  it,  were  performed  at  the  minimum  resolution  required  to  guarantee  that 
there  would  be  no  aliasing  in  the  final  answer.  To  alleviate  the  storage  problem,  and 
also  to  reduce  somewhat  further  the  computation  time,  it  has  been  suggested  that  these 
computations  be  performed  at  a  lower  resolution.  Decreasing  the  resolution  by  a  factor  of 
2  would  save  a  factor  of  4  both  in  computation  time  and  in  storage  time,  at  the  cost  of 
accepting  some  aliasing  error  in  the  result.  In  this  paper,  we  investigate  the  precise  effect 
of  the  aliasing  error  on  the  answer,  with  attention  given  to  the  particular  requirements  of 
the  microelectronics  industry. 

2.  Theory 

It  is  well-known^’'^  that  lenses  act  as  spatial  filters  when  creating  an  image  of  an  object: 
components  of  the  electromagnetic  field  with  low  spatial  frequency  (slow  variation  in  X 
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and  Y)  are  passed  through,  while  components  with  high  spatial  frequency  (fast  variation 
in  X  and  Y)  are  removed.  The  coherent  transfer  function  of  a  simple  imaging  system  (the 
complex  amplitude  of  a  disturbance  at  the  image  point  {xi,yi)  due  to  a  unit  disturbance 
at  the  object  point  {xo,yo)  —  see  figure  1)  is  given  by 

1  rNA  i*27r 

K(Xo,yo,Xi,yi)  =  —  /  e^H^-^Xi+Xo/M)pcose-{yi+yJM)psin9) 

^  Jo  Jo 

Here  NA  is  the  so-called  numerical  aperture.  This  is  a  dimensionless  measure  of  the  radius 
of  the  imaging  system,  defined  as  r^f  \/r1  +  df.  M  is  the  magnification  factor  of  the 
imaging  system,  p  and  6  are  dimensionless  polar  coordinates  in  the  lens  plane.  $  is  an 
aberration  function,  which  contains  true  lens  aberration  terms  as  well  as  the  defocus  Zj . 

This  expression  for  the  coherent  transfer  function  is  derived  by  starting  from  the 
Kirchoff  theory  of  diffraction  and  making  the  thin-lens  approximation  and  the  paraxial 
approximation.  The  thin-lens  approximation  means  that  the  lens  system  introduces  only  a 
phase  shift,  but  no  lateral  shift,  into  the  electric  field  as  it  passes  through  the  lens  system. 
The  paraxial  approximation  means  that  only  rays  that  are  nearly  parallel  to  the  optical 
axis  are  considered,  implying  that  sin^  «  4>,  where  (f)  is  the  angle  between  the  ray  and 
the  optical  axis.  Both  of  these  assumptions  are  made  solely  to  simplify  the  presentation; 
without  them,  the  formulas  become  more  complex,  but  the  conclusions  remain  equally 
valid.  The  computations  presented  below  are  in  fact  made  using  the  more  exact  formulas 
as  implemented  in  the  FAIM  (Fast  Aerial  Image  Model)  program. 

By  scaling  the  object  coordinates  as  x'^  =  -XojM,  y'^  =  -yo/M,  the  coherent  transfer 
function  may  be  written  as 

-|  ^NA  p27r 

K{Xi-x'^,yi-y'^)  =  — J  J  e^H^-^=>=i-x'^)pcose-{y,-y'^)psin9) 

■  K 

The  electric  field  in  the  image  space  is  then  given  by 

/oo  poo 

/  F{x'^,  y'^)K{x'^,  y'^,  Xi,  yi)  dx'^  dy'^ 

-OO  J  —  OO 

/oo  poo 

/  ^(^o>  y'o)^{^i  -  Vi  -  y'o)  dx'o  dy'o^ 

-oo  J  —  oo 

where  F  is  the  function  which  defines  the  electric  field  as  a  function  of  position  in  the  object 
plane.  (In  general  this  would  be  a  function  of  position  in  the  three-dimensional  object 
space,  but  for  microlithographic  applications  it  is  sufficient  to  consider  planar  objects 
only.)  The  expression  for  the  electric  field  is  a  convolution,  which  may  be  evaluated  very 
simply  by  Fourier-transforming: 

EifxJy)=F{f.Jy)K{U,fy) 
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Because  the  lens  system  has  a  finite  radius,  the  Fourier  transform  of  the  coherent  transfer 
function  is  zero  beyond  a  certain  radius  in  spatial  frequency  space  (/x,/y):  specifically, 
the  cutofi"  is  at  NA  / A.  According  to  the  Nyquist  theorem,  it  is  therefore  necessary  to  use 
a  spatial  grid  with  a  spacing  of  A/2  NA  or  finer  in  order  to  properly  resolve  all  frequency 
components  that  are  present  in  the  electric  field  in  the  image  space.  In  order  to  compute  the 
intensity  in  the  image  space,  the  electric  field  is  squared.  This  squaring  doubles  the  highest 
spatial  frequency  that  is  present,  so  that  in  order  to  adequately  resolve  all  components  of 
the  image  intensity,  it  is  necessary  to  use  a  spatial  grid  with  a  spacing  of  A/4  NA  or  smaller. 
If  a  coarser  grid  is  used,  there  will  be  aliasing  effects. 


3.  Practice 

In  microlithographic  applications,  the  object  being  imaged  is  the  mask,  which  Is  de¬ 
scribed  by  a  piecewise-constant  discontinuous  function.  Errors  due  to  aliasing  may  be 
regarded  as  either  locally  changing  the  value  of  the  mask  function,  or  as  changing  the 
positions  of  the  discontinuities.  Of  these  two  types  of  error,  the  second  is  much  more 
important  for  microlithographic  applications.  The  discontinuities  mark  the  pattern  which 
is  to  be  printed,  and  changes  in  position  of  even  a  few  tens  of  nanometers  (a  few  percent 
of  the  nominal  linewidth)  can  mean  the  difference  between  a  working  and  a  non-working 
integrated  circuit. 

Figure  2  shows  a  small  sample  pattern,  about  14/rm  x  lO/zm,  that  is  representative 
of  mask  patterns  actually  used  in  the  microelectronics  industry.  The  lines  and  the  spaces 
between  them  are  0.35jj,m  in  size.  The  wavelength  used  to  image  this  sample  pattern  is 
0.365//7TT.,  and  the  NA  is  0.55.  These  values  are  typical  for  current  (1996)  computer  chip 
production.  Figure  3  shows  the  image  computed  at  a  resolution  of  A/16  NA.  This  image  is 
essentially  exact;  there  are  no  aliasing  errors  present.  The  intensity  contour  levels  shown 
are  0.1,  0.3,  and  0.5.  Note  that  the  curves  are  quite  smooth,  except  where  features  interact. 
This  smoothness  is  expected:  the  filtering  action  of  the  lens  system  tends  to  smooth  out  the 
pattern  being  imaged.  In  marked  contrast,  the  image  shown  in  figure  4  shows  significant 
ripples  in  the  contour  levels.  The  only  difference  between  the  two  images  in  figures  3  and 
4  is  that  the  latter  was  computed  at  a  resolution  of  A/2NA.  The  ripples  are  unphysical 
and  are  solely  due  to  the  aliasing  caused  by  the  inadequate  resolution.  Figure  5  shows 
the  difference  between  the  two  images,  overlaid  with  the  actual  mask  pattern.  The  shaded 
regions  indicate  where  the  magnitude  of  the  difference  exceeds  0.2.  From  this  figure,  it 
is  quite  clear  that  the  aliasing  errors  which  have  been  introduced  into  the  low-resolution 
image  significantly  change  the  effective  position  of  the  pattern  edges.  Finally,  in  figure  6 
the  maximum  error  due  to  aliasing  is  plotted  as  a  function  of  the  relative  resolution  for 
several  different  images.  A  relative  resolution  of  1  means  that  the  grid  size  is  A/4NA.  In 
all  cases,  the  maximum  error  at  a  resolution  of  A/2  NA  (a  relative  resolution  of  1/2)  is  quite 
large,  of  the  order  of  0.3  or  more.  Since  the  maximum  intensity  is  between  1  and  1.2  in  all 
cases,  an  error  of  this  magnitude  is  very  significant.  In  order  to  achieve  a  maximum  error 
that  is  smaller  than  0.01,  it  is  necessary  to  have  a  relative  resolution  of  0.9  or  higher.  For 
relative  resolutions  larger  than  or  equal  to  1,  the  behavior  of  the  error  depends  on  whether 


3 


the  image  was  computed  assuming  a  periodic  mask  pattern  or  an  aperiodic  pattern.  The 
error  of  ~  3  x  10-5  in  the  aperiodic  case  is  due  to  the  way  in  which  the  edges  of  the  overall 
domain  are  treated  in  the  computation.  Because  this  edge  effect  is  quite  small  compared 
with  the  aliasing  errors  which  are  present  at  relative  resolutions  less  than  1,  and  because 
it  occurs  well  away  from  the  mask  patterns  which  are  of  physical  interest,  it  may  safely 
be  ignored.  If  a  periodic  pattern  is  assumed,  there  is  no  aliasing  error.  There  is  a  very 
small  difference  of  about  10~®,  not  shown  in  the  figure,  but  this  is  caused  by  roundoff 
errors  in  writing  the  computed  intensity  field  to  a  file,  not  by  any  aliasing  error  during  the 
computations. 

4.  Conclusion 


To  summarize,  it  is  evident  that  aliasing  errors  are  present  in  image  intensity  'com¬ 
putations  made  at  resolutions  of  less  than  A/4NA.  Due  to  the  particular  nature  of  the 
object  being  imaged,  and  due  to  the  specific  needs  of  the  microelectronics  industry,  the 
magnitude  of  these  aliasing  errors  is  unacceptably  high  at  resolutions  near  A/2NA.  Since 
it  is  necessary  to  go  almost  to  a  resolution  of  A/4NA  in  order  to  achieve  an  error  below 
0.01,  it  must  be  concluded  that  it  is  not  possible  to  achieve  significant  tihae  or  storage 
savings  by  sacrificing  accuracy  in  resolution. 
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7.  Figure  Captions 

Fig.  1.  Diagram  of  a  simple  thin- lens  optical  system.  The  **thin-lens”  approximation  means 
that  refraction  effects  at  the  front  and  back  surfaces  of  the  lens  are  not  considered 
separately,  but  are  lumped  together  into  a  single  change  of  direction  at  the  central 
plane  of  the  lens. 

Fig.  2.  A  small  sample  mask  pattern.  The  hnes  to  be  printed  are  0.35/im  thick. 

Fig.  3.  Contour  lines  of  the  correctly  computed  image  of  the  mask  in  figure  2.  The 
intensity  levels  shown  are  0.1,  0.3,  and  0.5.  Notice  that  the  contour  lines  are  quite 
smooth,  except  where  features  interact. 

Fig.  4.  Contour  lines  of  the  under-resolved,  incorrectly  computed  image  of  the  meisk  in 
figure  2.  The  intensity  levels  shown  are  again  0.1,  0.3,  and  0.5.  In  contrast  to  figure 
3,  the  contour  lines  are  wavy  even  where  no  waviness  is  expected. 

Fig.  5.  The  difference  between  the  correct  and  incorrect  intensity  fields.  The  gray  blotches 
indicate  those  regions  where  the  difference  exceeds  0.2.  Notice  that  the  blotches  lie 
primarily  near  or  on  the  nominal  mask  edges. 

Fig.  6.  The  maximum  aliasing  error  as  a  function  of  the  relative  resolution  for  several 
different  images.  The  first  two  sets  of  points  are  for  the  periodic  and  non-periodic 
versions  of  the  mask  shown  in  figure  2,  while  the  last  two  are  for  different  mask 
patterns  which  are  not  shown  here. 
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Sample  Mask  Pattern 


Sample  Mask  Imaged  at  Low  Resolution 


Figure  4 
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Phase-shifting  masks  have  noticeable  width.  They  are  fabricated  with 
the  layer  of  quartz  on  the  top  (or  bottom)  of  a  thin  chrome  layer.  To  obtain 
phase  shift  tp  quartz  is  etched  back  on  the  depth 


d  = 


Xip 

27r(n  -  1) 


(1) 


where  A  is  w’ave-length  and  n  is  quartz  refrective  index.  It  is  natural  to 
expect  that  finite  mask  thickness  not  only  leads  to  extra  phase  but  also  will 
lead  to  a  cirtain  corrections  in  mask  transfer  functions. 


1  Basic  formulas 

1.1  Plane  waves  representation 

We  use  only  scalar  diffarction  approximation  in  Kirchoff  approximation.  We 
define  the  basic  wave-number  ko  —  27r/A.  The  wave-number  is  represented 
as  (k,  k^)  and  k  =  (p,  q)  is  a  two-dimensional  wave-number  -f  k^  =  ko).  If 
the  Fourier  transform  of  the  field  u{x,y,z)  (satisfying  Helmholtz  equation) 
is  known  at  plane  z'  then  the  Fourier  transform  of  the  same  field  in  the  half 
plane  z  >  z'  without  extra  sources  is 

u(k,  z)  =  exp  [—ikz{z  —  z')]  u(k,  z’)  (2) 

It  is  also  assumed  that  k^  =  (Ajq  —  k^)^/^  the  sign  corresponds  to  correct 
direction  of  propagation.  Complex  k^  corresponds  to  evanescent  waves. 
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If  numerical  aperture  of  the  optical  or  illuminating  system  is  small  then 
we  may  assume  that  |k|  <<  and  ^  ko  —  |kp/(2A:o).  This  approxima¬ 
tion  is  known  as  Fresnel  approximation  or  parabolic  approximation.  Indeed, 
returning  to  phisical  space  and  assuming  that  \z  —  z'\  =  d 


u{x,  z)  = 


.exp(—ikod) 
^  Ad 


J  u{x',z') 


exp 


•  kg  .  ,-2 

-t  — (X  —  X  ) 

2d^  ’ 


d?x' 


and  this  is  obviously  Fresnel  representation. 
In  the  Fraunhofer  limit 


u(x,  z)  =  i 


.exp{—ikoRo 


\d 


exp 


iki 


X  •  X 


(f 


where  Rq  —  {d?  +  That  is  equivalent  to 


(3) 


(4) 


(5) 


That  is  an  image  is  a  Fourier  transform  of  initial  distribution  taken  at  wave- 
numbers  corresponding  to  angles  of  difraction.  On  the  other  hand  if  Rq  ^  d 
this  formula  may  be  inverted 


-/I  ^  .exp{-ikod)  k 

“(k,  z)  =  z - - - ^(<^7-,  z  ) 


(6) 


1.2  Object  illumination 

We  assume  that  Kohler  or  critical  illumination  is  used.  In  this  case  it  follows 
that  mutual  coherence  may  be  represented  as 

J(x  -y)  =  J  d}kF{k)  exp‘*(^-^)  (7) 

where  F{k)  is  an  amplitude  of  a  coherent  plane  wave.  Moreover  F[k)  equals 
to  the  form  of  the  pupil  function  of  an  apperure  (with  the  numerical  aperture 
Nc)  used  in  the  condenser.  For  example,  if  a  condeser  aperture  is  circular. 

F{k)  =  e{Nl  -  \k\^lkl)  (8) 

Thus  to  obtain  an  image  from  this  illumination  we  need  to  find  an  image 
from  each  coherent  wave  and  then  find  the  incoherent  sum  of  the  waves  with 
corresponding  amplitudes. 
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1.3  Optical  system 

The  lens  represents  a  low-pass  filter  with  lens  transfer  function  K(^Xo,Xi), 
where  and  x,  coordinates  in  object  and  image  spaces.  If  and  u,  fields 
in  object  and  image  spaces,  respectively.  Then 

Ui{x,)  =  J  K{x^,x,)uo{xo)cPx  (9) 

The  general  form  of  transfer  function  is 

K{ko,  k)  %  +  Mh)e{Nl  -  k})  (10) 

where  k^  and  A:,  are  wave-numbers  in  object  and  image  spaces,  M  is  angular 
magnification  (or  image  space  reduction).  T{ko,  h)  is  the  angle  characteristic, 
of  the  lens  that  should  include  both  Gaussian  approximation  and  abberations 
Na  is  numerical  apperture  of  the  lens.  Final  image  field  is 

Fih)  =  I  F,{k,)K{K,h)6?k,  (11) 

All  the  imaging  properties  of  the  lens  depends  in  this  representation  on  the 
angle  characteristic  of  the  lens. 

It  is  possible  to  use  specific  T[k^,k,)  directly,  but  usually  2^  and  2^  are 
chosen  the  way  that  these  planes  are  imaged  in  each  other  within  Gaussian 
optics.  In  this  case 

K[ko,h)  ^  KnF"'^^'^'S(ko  +  Mk,)e{Nl  -  ^1)  (12) 

and  Zi  is  defocus. 

1.4  Object  transfer  function 

The  monochromatic  wave  with  wave-number  k  passes  trough  the  object  and 
scater.  The  amplitude  after  the  object  is 

Fo{ko)  =  j  F{k)0{k,K)d?k  (13) 

where  0{k,  k^)  object  transfer  function.  In  case  when  0{x,y)  =  6{x-y)0{x) 
(infinitely  thin  object)  0(^k,  ko)  =  0(^k  —  kg')  and  Oiji)  is  Fourier  transform 
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of  the  object.  Therefore  object  effectively  performs  convolution  in  Fourier 
space.  Only  small  Icq  are  imaged  in  the  lens. 

Rigorously  to  solve  the  problem  of  thick  mask  one  should  solve  Maxwell 
equations  at  the  mask  for  different  incident  waves  and  then  decompose  the 
solution  in  Fourier  series  which  are  imaged  by  lens.  Only  slowly  varing  func¬ 
tions  on  the  mask  are  imaged.  This  is  because  of  low  numerical  aperture 
NalM  at  object  side.  The  most  rough  estimate  of  the  influence  of  the  thick¬ 
ness  of  object  can  be  obtained  by  noticing  that  by  passing  through  object 
wave  get  the  phase 

Ott 

exp{i—{l  -  k'^f^^d{n  -  1))  -  exp{iip(l  -  (14) 

Using  the  fact  that  ki  =  k^/M  we  see  that  this  equivalent  to  introducing 
defocus  Zo  =  d(^n  — 

Simplest  method  of  implementation  is  to  replace  Fourier  transform  of 
object  by 

exp{i'p{l  -  k‘^y^'^)0{k)  (15) 

where  (/?  is  a  phase  shift  of  a  given  object. 

Therefore,  we  may  see  that  first  very  approximate  correction  to  the  mask 
thikness  is  an  introduction  of  defocus  level 


d{n  —  1) 
M2 


This  can  be  interpreted  as  if  imaging  of  the  mask  occurs  the  way  that  holes 
without  phase-shift  are  imaged  as  they  are.  The  hole  with  phase-shift  are 
imaged  the  way  as  if  they  are  defocused  by  the  amount  z^,.  The  reason  why 
these  holes  are  not  equivalent  is  that  side-walls  of  the  hole  have  nearly  zero 
amplitude  of  the  field  on  them  [3].  These  follows  mostly  because  the  angle 
of  incidence  is  very  small  and  the  hole  plays  the  role  of  perfectly  reflecting 
waveguide.  Then  in  this  case  we  can  use  KirchhoS'  representation  using  the 
interface  between  air  and  quartz  instead  of  apperture. 

Obviously  that  is  very  crude  estimate.  The  only  way  I  see  to  improve 
it  is  to  solve  the  whole  problem.  This  way  is  obviously  impractical.  May 
be  an  alternative  way  is  to  tabulate  various  defocuses  for  various  form  and 
materials  and  use  these  as  the  correction. 

First  it  should  be  verified  that  mask  thickness  may  be  taken  care  of  by 
introducing  defocus.  One  more  reason  why  this  can  be  so  is  that  only  very 
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smoothly  varying  transfer  functions  on  the  hole  can  be  imaged  by  small  nu¬ 
merical  apperture  (on  the  object  side)  optical  system.  In  this  case  we  may 
expand  corrections  in  Taylor  series.  There  should  not  be  linear  terms  because 
of  symmetry.  Quadratic  terms  of  the  expansion  are  essentially  equivalent  to 
introducing  small  defocus.  Indeed  quadratic  variations  in  phase  in  final  in¬ 
tegral  are  equivalent  to  quadratic  variations  in  space  for  abberation  function 
and  that  means  the  presence  of  defocus. 

From  the  experimental  point  of  view  the  presence  of  defocus  was  under¬ 
lined  in  [4],  [5].  Although  it  was  not  checked  there  that  the  hole  change  of  the 
image  can  be  described  by  introdusing  defocus  this  seems  plausible  judging 
from  the  data  of  [4].  Also  it  was  noticed  in  [3]  that  the  influence  of  thickness 
of  mask  decrease  dramaticall  with  increase  of  magnification  M.  Indeed  it 
should  decrease  quadratically. 

The  defocus  was  observed  in  [5]  to  depend  slightly  on  the  width  of  the 
■hole  and  on  whether  the  mask  is  binary  or  attenuated  phase-shifting.  So  far 
the  only  conclusion  that  can  be  drawn  is  that  accounting  for  the  width  of  the 
mask  by  introducing  additional  defocus  may  be  convinient  parametrization 
of  the  effect  of  the  mask.  This  parametrization  is  purely  impirical  and  require 
thorough  testing  with  rigorous  codes  such  as  EMFLEX  or  TEMPEST.  I  think 
that  we  need  either  wright  this  type  of  codes  ourselves  or  buy  them. 
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ABSTRACT 

An  algorithm  for  the  optimiaation  of  stepper  parameter*  has  been  designed  and  implemented.  The  cost 
function  used  In  this  optimisation  is  the  contrast.  The  aerial  Image  it  computed  using  the  computer  code 
PAIM.^’*  First,  the  contrast  of  the  image  is  calculated  and  the  derivatives  of  the  contrast  with  respect  to  the 
stepper  parameters  are  evaluated.  The  computational  cost  of  these  calculations  is  only  slightly  more  than  that 
of  one  aerial  image  simulation.  A  coiyugate  gradient  type  algorithm  is  then  used  to  obtain  the  minimum  of  the 
contrast. 


1.  INTRODUCTION 

Stepper  conditions  Lave  a  significant  effect  on  mask  printability.  It  is  tiicreforc  very  important  to  optimize 
the  stepper  conditions  to  yield  the  best  possible  stepper  performance.  Traditionally  this  U  done  through  trial 
and  error.  Here  we  report  the  development  and  implementation  of  an  algorithm  that  shortens  this  procedure  by 
optimising  the  illuminator  design  and  numerical  aperture  averaging  over  several  dcfocus  values. 

We  first  compute  the  aerial  Image  of  the  given  mask  using  our  Fast  Aerial  Image  Module  code,  FAIM.  Then 
we  calculate  the  contrast  associated  with  the  aerial  image  and  the  derivative  of  the  contrast  with  respect  to 
the  stepper  parameters  to  be  optimised.  Finally  the  contrast  is  miaimlied  by  taking  an  iterative  step  in  the 
appropriate  direction  determined  using  a  conjugate  gradient  type  algorithm. 

In  this  paper  we  will  first  describe  the  procedure  for  the  optimisation  of  the  stepper  parameters  (cost,  dcrlva^ 
tivea  of  cost  with  respect  to  the  optlmlxation  parameters,  constraints  and  convergence  criteria)  at  fixed  dcfocus. 
Then  we  will  discuss  the  optimisation  of  stepper  parameters  over  a  specified  defocus  range  for  the  purpose  of 
doing  optical  proximity  corrections  (OPC).  We  will  show  examples  of  how  the  optimisation  procedure  works  on 
some  current  mask  designs. 


FEB-10-1996  13:30 


P.  02/^10 


2,  OPTIMIZATION  PROCEDURE 


2.1  Cost  and  it’s  derivatives 


We  define  /p  to  be  the  ideal  intensity  pattern  that  we  wish  to  obtain  on  the  wafer  or  image  plane.  For  example, 
with  a  dark  field  mask  the  value  of  is  I  on  portions  of  the  wafer  plane  that  we  wish  to  expose  and  0  on  portions 
that  we  do  not  wish  to  expose.  We  define  I  to  be  the  intensity  pattern  obtained  on  the  wafer  plane  using  the 
FAIM  simulation.  Then  we  can  write  the  contrast,  which  is  the  cost  to  be  minimised,  as, 


(1) 


where  C  is  the  contrast  and  dA  refers  to  an  area  element  of  the  wafer.  The  objective  of  the  optimisation  is  to 
minimixe  C  so  that  I  is  as  close  to  as  possible,  /p  is  a  constant  during  the  optimisation  and  i  is  a  variable 
which  varies  when  the  stepper  parameters  are  changed.  Let  us  refer  to  the  stepper  parameters  in  general  as 
PijP2i  ‘-Mpn-  Then  the  variation  of  the  cost  with  respect  to  the  stepper  parameters  can  be  written  as 


where  dC/dpy  U  the  partial  derivative  of  C  with  respect  to  pr.  We  see  from  (2)  that  if  the  signs  of  Sp^s  are 
®PP^^te  to  that  of  BC / then  60  will  always  be  less  than  zero,  which  is  the  objective  of  the  optimisation. 


2.2  Calculation  of  derivatives 


The  intensity  at  the  Image  plane  can  be  written  as,^ 

where  If  w  the  normalisation  factor,  HT -^m)  is  the  amplitude  transmission  factor  from  image  to  mask, 
T{Xn)  is  the  transmissioB  function  of  the  mask  and  J{Xm,X'^)  is  the  mutual  intensity  distribution  of  the 
illuminator.  J^X^tX^)  is  computed  fay  integrating  the  amplitude  transmission  function  of  the  condenser  lens 
system  over  the  illuminator  aperture. 


J{X,n,XU)=  /  F{X„,XU,^.v)<K<iV  (4) 

•/ilium.  i 

where  P  Is  the  amplitude  transmission  function  of  the  condenser  lens  system,  ^  and  tj  are  the  coordinate  variables 
of  the  illuminator  aperture  area. 


In  fact  the  first  partial  derivatives  of  the  intensity  with  respect  to  the  illuminator  aperture  parameters  is  a 
by-product  of  the  intensity  calculation.  For  example,  take  the  case  of  a  circular  illuminator.  To  calculate  the 
Intensity,  we  have  to  integrate  over  the  area  of  the  circular  aperture.  To  calculate  the  derivative  with  respect  to 
the  partial  coherence  factor  or, 'we  have  to  integrate  just  over  the  circular  boundary.  This  information  is  available 
when  calculating  the  intensity.  Indeed,  K{Ximaf€sXm)  »  given  by,^ 


=//  ■ 


Xftij  Af,  A) 


(^) 
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where  i  is  a  unit  vector  that  points  in  the  direction  from  the  point  in  the  Gaussian  image  plane  on  the  optic  axis  to 
a  point  on  the  exit  pupil  plane,  NA  represents  the  rnnnerica]  aperture  on  the  right  hand  side  of  the  objective  lens, 
z/  is  the  defocus  value  and  ilf  U  the  magnification.  Again  for  the  same  reason  as  for  the  illuminator  parameters, 
we  see  that  the  first  partial  derivative  of  the  intensity  with  respect  to  the  numerical  aperture  is  a  by-product  of 
the  intezulty  calculation. 

If  a  “flood  expoaure”  normalisation  is  uacd  for  the  intensity^  then  is  also  a  function  of  the  stepper  parameters. 
In  the  case  of  this  normalization,  =  1  for  the  case  of  “flood  exposure”  where  the  mask  is  removed. 

Thus,  T(Xn)  =  I  for  all  jf„  in  (3).  The  equation  of  Ip  is  very  similar  to  that  of  1.  Therefore,  the  evaluation 
of  the  partial  derivatives  of  Ip  with  respect  to  the  stepper  parameters  is  similar  to  the  evaluation  of  the  partial 
derivatives  of  the  intezuity. 

The  derivative  of  the  first  partial  derivative  of  the  contrast  with  respect  to  the  stepper  parameters  can  be 
obtained  from  the  derivative  of  the  intensity  using  (2). 


2.3  Convergence 


We  use  a  conjugate  gradient  type  algorithm  to  direct  the  iterations.  The  Jacobian  of  the  cost  with  respect 
to  the  optimisation  parameters  is  calculated  by  fitting  the  coat  function  to  a  paraboloid  baaed  on  the  available 
information  of  coat  and  its  first  partial  derivatives.  Convergence  is  usually  determined  by  the  magnitude  of  the 
partial  derivatives.  If  they  are  small  enough,  the  minimum  is  nearby.  There  are  also  constraints  on  the  ranges 
of  the  optimisation  parameters  and  their  magnitudes  relative  to  each  other.  For  example,  when  optimising  an 
annular  illuminator,  the  individual  partial  coherence  factors  should  be  in  the  range  (0,  1),  and  <ri  <  (Tj.  Due  to 
these  constraints,  in  some  of  the  masks  at  the  minimum,  the  partial  derivative  of  the  contrast  with  respect  to 
some  of  the  optimisation  parameters  need  not  be  rero. 

In  most  cases,  the  optimisation  takes  about  five  to  six  iterations  to  converge.  For  many  masks,  there  is 
significant  improvement  in  line  width  control  and  in  some  cases,  there  is  improvement  in  line  shortening  and 
comer  rounding  errors,  as  will  be  demonstrated  in  the  examples  below. 


3.  DEFOCUS  RANGE 


Defocus  range  is  critical  in  the  manufacturing  environment  and  must  be  taken  into  account  at  oiTtimes.  In 
the  case  of  the  optimisation  of  stepper  parameters,  it  is  not  a  good  idea  to  minimise  the  stxm  of  the  contrasts  at 
scro  defccui  (best  focus)  and  end  range  defocus.  This  b  because,  the  minimum  solution  may  involve  the  situation 
in  which  the  aerial  image  b  perfect  and  meets  almost  ail  the  specifications  at  one  defocus,  whereas  at  the  other 
defocuB  it  faib  badly  in  meeting  the  specifications.  OPC  cannot  be  used  to  correct  thb  kind  of  a  mask.  It  is 
better  to  optimbe  the  contrasts  at  the  two  defocuses  separately  and  then  take  the  average  value  of  the  optimbed 
parameters.  Thb  tends  to  ensure  uniform  failures  at  both  defocusea,  which  means  that  OPC  can  do  a  good  job 
in  correcting  the  mask. 

There  b  a  significant  reduction  in  the  amount  of  OPC  corrections  needed  to  repair  an  optimised  mask  as 
compared  to  the  unoptimbed  case.  Thb  will  be  demonstrated  in  the  examples. 
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4.  EXAMPLES 


1.  “Nikon”  ttst  mask  with  circular  iUuminator 

In  this  example  we  present  the  optimisation  of  the  “Nikon*  maak  (Shiralshi  ct  al.).  The  mask  has  a  circular 
illuminator.  The  optimisation  parameters  are  the  partial  coherence  factor  of  the  IUuminator  and  the  numerical 
aperture.  In  figures  1(a)  and  1(b)  we  plot  the  portions  of  a  part  of  the  mask,  before  and  after  the  optimisation, 
which  have  failed  to  meet  design  specifications,  as  reported  by  our  optical  design  rule  checker  (ODRC),  CDLOSS.® 
CDLOSS  is  used  to  determine  where  the  mask  pattern  fails  to  print,  assuming  a  simple  threshold  model  for  the 
exposure,  post-exposure  bake,  and  dissolution.  It  is  clear  from  these  figures  that  massive  line  width  failures  have 
been  corrected  by  the  optimisation.  However,  there  is  no  improvement  in  the  line  shortening  or  comer  rounding 
errors.  These  failures  have  been  calculated  at  0.8/^  dcfocus.  In  figure  1(c)  we  plot  the  contrast  as  a  function  of  the 
iteration  number.  It  is  clear  that  the  contrast  decreases  as  the  optimisation  proceeds  and  convergence  is  attained 
after  about  six  iterations.  In  figure  1(d)  we  plot  the  optimisation  parameters  NA  and  cr  as  a  function  of  iteration 
number. 

2.  SRAM  gate  mask  with  double  annular  illuminator 

In  figure  2(a)  we  plot  the  CAD  layout  of  an  SRAM  gate  design.  The  double  annular  illuminator  consists  of 
two  sets  of  annular  apertures,  one  around  the  other.  Thus,  there  arc  four  partial  coherence  factors.  In  figures  2(b) 
and  2(c)  we  plot  a  portion  of  the  failures  of  the  SRAM  mask  at  0,8;i  dcfocus,  before  and  after  optimisation.  It  is 
seen  that  there  are  few  remaining  failures  of  the  optimked  mask.  In  this  case  there  is  a  significant  improvement 
in  line  width,  line  shortening  and  comer  rounding  errors.  In  figure  2(d)  we  give  a  table  of  the  partial  coherence 
factors  and  the  corresponding  contrasts  as  the  iteration  proceeds.  After  the  second  iteration  03  and  cr^  remain 
constant  because  of  the  constraint  C3  <  cr^. 

3.  Influence  of  the  optimization  of  stepper  parameters  on  OPC  corrections 

In  figure  3(a)  we  plot  the  CAD  layout  of  an  OPC  corrected  test  mask  with  an  ucoptimixed  circular  illuminator. 
In  figure  3(b}  wc  plot  the  CAD  layout  of  an  OPC  corrected  test  mask  with  an  optimized  circular  illuminator.  We 
obtained  the  optimised  partial  coherence  factor  by  taking  the  average  of  the  optimized  partial  coherence  factors 
at  O.C  and  0,4;^  defocus.  It  can  be  seen  that  very  few  OPC  corrections  have  to  be  made  for  the  optimized  case 
as  compared  to  the  unoptimised  case.  This  is  because  many  of  the  design  specifications  are  already  met  after 
optimizing  the  illuminator.  The  OPC  corrections  were  determined  using  our  program  OPTIMASK.^*^  In  figures 
3(c)  and  3(d)  wc  plot  the  aerial  images  of  the  unoptimixed  test  mask  and  OPC  corrected  optixnixed  lest  mask  at 
0,0  and  0.4pt  dcfocus  overiayed  onto  the  original  mask.  Improvements  in  line  shortening,  line  width  and  comer 
roundings  can  be  seen.  In  figures  3(e)  and  3(f)  we  plot  the  design  failures  of  the  OPC  corrected  optimized  test 
mask.  It  is  seen  that  thcac  arc  minimum.  These  failures  were  determined  by  CDLOSS. 

4.  SRAM  test  mask  with  annular  illuminator 

In  figure  4(a)  wc  plot  the  OPC  corrected  SRAM  test  mask  (provided  by  DEC)  with  an  unoptimized  annular 
illuminator.  In  figure  4(b)  we  plot  the  OPC  corrected  DEC  SRAM  mask  with  the  optimized  annular  illuminator. 
The  reduction  in  the  OPC  corrections  for  the  mask  with  the  optimized  illuminator  can  be  clearly  seen.  The  aerial 
images  of  these  masks  were  calculated  at  I-line  exposure. 

5.  Comparison  of  total  failure  lengths 

In  table  1,  wc  give  the  total  length  of  the  portions  of  the  mask  which  failed  to  meet  design  specifications, 
for  the  various  masks  described  above.  In  all  the  cases,  a  reduction  in  the  failure  length  for  the  mask  with  the 
optimised  Illuminators  is  apparent. 
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Ma^ 

A  (nm) 

Defocus  [ft] 

Ulum. 

Total  Failure  Length  (>t) 

Unoptimised  Ilium. 

Optimised  Ilium. 

“Nikon” 

365 

0.8 

Circle 

113.85 

69.82 

SRAM  gate 

365 

0.8 

Double  Annulus 

295.7 

129.7 

SRAM  gate 

365 

26.7 

14.38 

Test  mask 

365 

0.4 

Annxilus 

33.77 

9.69 

Tat  mask 

365 

0.0 

Aoaulus 

34.33 

16.828 

SRAM  test 

365 

0.4 

Annulus 

27.5 

23.8 

SRAM  test 

365 

0.0 

Annulus 

44.4  1 

18.03 

248 

0.4 

Annulus 

23.85  ^ 

14.40 

1  SRAM  test  (quarter 

iSS 

Annulus 

23.00 

14.32 

Tiblc  1:  Tot^  f^iurc  lengths  for  various  masks. 


Mask 

Wavelength  (nm) 

Illuminator 

Minimum  Contrast 

SRAM  test 

365 

0.4 

Circle 

4.1917 

SRAM  test 

365 

0.4 

Annulus 

SRAM  test 

365 

0.4 

Double  Annulus 

Tiblc  2:  Minimum  contrast  for  various  optimised  illuminators. 


6.  Comparison  of  vaxious  illuminators 

In  table  2,  we  give  the  contrast  in  the  aerial  image  for  the  DEC  SRAM  mask  at  0.4p»  defocus  for  optimised 
circular,  annular  and  double  annular  illuminators.  As  can  be  seen  from  the  table,  the  double  annular  illuminator 
does  the  best  with  the  lowest  contrast.  The  annular  illuminator  is  next  best  while  the  circular  iliummator  is  worst 
at  least  for  this  case. 


5.  SUMMARY 


In  this  paper,  we  have  described  an  algorithm  to  optimise  stepper  parameters.  Wc  have  shown  examples  of 
how  the  optimisation  procedure  works  for  some  current  mask  designs,  Wc  have  also  shown  the  influence  of  the 
optimised  stepper  parameters  on  OPC  corrections. 
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(C) 


Figure  1 


(d) 


(a)  Portions  of  a  part  of  the  unoptimizcd  "Nikon"  mask  that  have  failed  to  meet  design 
specifications  (a  =  0.45,  NA  =  0.4,  dcfocus  =  0.8p) 

(b)  Portions  of  a  part  of  the  optimized  "Nikon"  mask  that  have  failed  to  meet  design 
specifications  (o  =  0.6,  NA  =  0.54,  dcfocus  =  0.8(1) 

(c)  Contrast  (cost  function  to  be  minimized)  as  a  function  of  the  iteration  number,  as  the 
opfimization  proceeds 

(d)  Optimization  parameters  (o  and  NA)  as  a  funtion  of  the  iteration  number  as  the 
optimization  proceeds 


(a) 


^2 

^3 

a 

4 

Contrast 

1 

0.4 

0.5 

0.6 

0.7 

15.479 

2 

0.38 

0.55 

0.62 

0.69 

15.156 

3 

0.36 

0.60 

0.61 

0.63 

14.576 

4 

0.38 

0.57 

0.61 

0.63 

‘  14.824 

5 

039 

0.56 

0.61 

0.63 

14.811 

6 

0.34 

0.49 

0.61 

0.63 

14.898 

7 

0.41 

0.51 

0.61 

0.63 

14.789 

(d) 


Figure  2 

(a)  CAD  drawing  of  SRAM  mask 

(b)  Portions  of  a  part  of  the  unoptimized  SRAM  mask  that  have  failed  to  meet  design  specifications 
(C  =  0.4,  a  =  0.5,  o  =  0.6,  o  =  0.7,  NA  =  0.54,  defocus  =  O.Sp.) 

*  ^  J  ^ 

(c)  Portions  of  a  part  of  the  optimized  SRAM  mask  that  have  failed  to  meet  design  specifications 

(o^=  0.41,  0.51,  OjSs  0.61,  0.63,  NA  =  0.54,  defocus  =  O.Sp) 

(d)  Table  showing  how  the  contrast  of  the  SRAM  mask  changes  with  the  optimization  parameters  (o  ,  O  , 

o  ,o  )  as  the  optimization  proceeds  (NA  =  0.54,  defocus  =  O.Sjj.)  *  ^ 

3  4 
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Figure  3 


(a)  OPC  corrected  mask  with  unoptimized  circular  illuminator  (c  =  0.48,  NA  =  0.54) 

(b)  OPC  corrected  mask  with  optimized  curcular  illuminator  (or  =  0.62,  NA  =  0.54) 

(c)  0.3  intensity  level  contour  of  aerial  images  of  unoptimized  mask  and  OPC  corrected  optimized 
mask  ovcrlaycd  onto  original  mask  (dcfocus  =  0.4p) 

(d)  0.3  intensity  level  contour  of  aerial  images  of  unoptimized  mask  and  OPC  corrected  optimized 
mask  overlayod  onto  original  mask  (defocus  =  O.Op) 

(e)  Portions  of  orig;inal  mask  which  are  out  of  specs,  at  0.4p,  defocus  for  the  optimized 
OPC  corrected  mask 

(f)  Portions  of  original  mask  which  arc  out  of  specs,  at  O.Oii  defocus  for  the  optimized 
OPC  corrected  mask 
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Abstract 

In  this  paper  we  address  the  problem  of  dose  correction  in  the  data  bases  consistent  with 
ulwa-Iarge-sca^e  ^integration.  It  is  shown  that  recent  advances  in  transformation  theorv  pro- 
viae  a  natural  platform  on  which  to  build  these  dose  correctors.  Specincallv.  transformation 
approaches  making  use  of  compactly  supported,  smooth  basis  functions  are 'shown  to  be  par¬ 
ticularly  suitable.  This  is  a  natural  result  of  the  evolution  of  mathematically  based  correctors 
currently  in  use.  Previous  work  of  Parikh,  MacDonald  and  others  employed  global  transform 
metRod  to  detenrune  the  values  of  '^corrected”  dose. 

In  most  cases,  the  mathematical  inversion  is  essentially  ill  posed,  in  other  words,  the  etcact 
pattern  desired  cannot  be  obtained  using  a  finite  Gaussian  sum. 

In  this  paper  a  set  of  smooth  basis  elements  of  compact  support  are  employed.  The  ma*he- 
maticaUy  smooth  form  of  the  basis  makes  it  easy  to  match  dos«  at  boundaries  without  Gibbs 
phenomenon.  Thus  the  transform  field  can  be  partitioned  for  optimum  saeed.  Consequent'v 
wane  most  transformation  complexities  are  of  order  iV®  (the  inversion  of  an  x  N-  matmx) 
wnere  is  the  number  of  grid  points  characterizing  the  database,  we  developed  an  alo-orithm 

of  commlexity  .V‘iog.V.  A  method  of  dose  field  bias  is  emmioved  to  stem*  the  reauesf=  'or 
negative  dose. 

The  heart  oi  the  numerical  process  is  essentially  based  on  an  adapted  fast  non-uniform-sria 

curler  Transicrm  combined  with  proper  filtering  and  geometric  localization  methods.  Several 
examples  are  given. 
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1  Introduction  | 

E-beam  lithography  the  exposure  tool  emits  a  very  narrow  electron  beam  (as  low  as  5nm  spot- 
size)  at  each  prescribed  pixel.  It  deposits  its  energy  in  the  resist.  However,  due  to  forward  scattering, 
backward  scattering  as  well  as  tertiary  reactions,  the  very  narrow  energy  distribution  changes  its 
nature.  This  fact  maybe  characterized  by  expressing  the  deposited  “point  spread  (PS)  energy"  (the 
jnergy  deposited  in  the  resist  per  pixel  of  the  E-beam  writer),  as  a  narrow  forward  peak  /,  with 
added  corrections.  The  main  correction  that  must  be  taken  into  account  is  the  backward  scattering 
3.  For  simplicity  of  presentation  we  limit  ourselves  to  back  scattering  correction  only,  even  though 
in  principle  amy  addition  to  the  forward  peak  can  be  included. 

In  essence  we  are  adopting  the  classic  model  of  Kyser  and  Murata  [l]  to  represent  the  PS  function 
as  a  sum  of  two  Gaussians.  The  formulation  and  analysis  below  is  independent  of  this  fact,  but  it 
is  a  convenient  as  well  as  conventional  representation.  Following  Gerber  [2],  Parkovich  [3],  Parikh 
■;4i  and  Haslam  and  McDonald  [5]  we  define  the  “Proximity  Equation"  as  the  equation  to  find  the 
“Beam  Density”  D,  which  represents  the  machine  setting  to  yield  the  desired  deposited  exposure 
energy  in  the  resist  E. 

However,  as  correctly  stated  by  Gerber  [2j  and  others,  the  proximity  equation  is  ill-posed.* Ac¬ 
cordingly  a  precise  mathematical  solution  that  makes  physical  sense  is  not  possible.  Heuristically  it 
is  clear.  An  ideal  deposited  energy  for  mask  fabrication  is  composed  of  features,  represented  as  dis¬ 
continuous  step  function  distributions.  The  point  spread  functions  are  infinitei-/  differentiable.  Thus 
any  finite  linear  combination  would  yield  a  continuous  function,  rendering  the  problem  insolvable. 

The  ill  posedness  of  the  proximity  equation  is  vividly  displayed  when  the  Gaussian  model  is 
imposed  [2]  and  is  a  resultant  of  the  ill  posedness  of  the  backward  diffusion  problem  exhibited  in 
the  forward  peak. 

It  is  the  purpose  of  this  paper  to  introduce  a  somewhat  modified  proximity  equation  and  provide 
numerical  analysis  of  its  structure  as  well  as  some  analytical  insights  and  specific  example. 

Once  the  dose  is  agreed  upon,  we  demonstrate,  as  exam.ples,  the  prohles  for  simulated  problems 
using  E-beam  into  chemically  amplihed  resist.  Hollerbach  has  modified  our  sim.ulator  MCP  [61  so 
resist  images  can  be  obtained  for  E-beam  lithography,  and  we  used  the  modified  MCP  throughout 
this  paper.  It  should  be  mentioned  that  Moniwa,  Yamaguchi  and  Okazaki  [Y  have  introduced  a  3D 
simulator  SEED  3D. 

2  Formulation  and  Technical  Issues 

\\t  model  the  point  spread  function  P  as 

P  =  f^T]h.  (1) 

Let  b,  the-  back  scattering  energy  distribution,  be  represented  as  a  convolution  of  a  density  B 
■’^th  /,  i.e. 

b  =  f*B  (2) 

where  *  represents  a  convolution,  rj  the  relative  strength  of  the  back  scattering  and  P  is  obtained 
as  - 
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Let  D  represent  the  “energy  density”,  or  the  E-beam  writer  seUin^s  Tb,,,  fb  f  .  i  j  . 
energy  E  on  the  resist  surface  is  given  by  j  e  o  a  epositej 

E  =  P.D^f.D*(l^rjB), 

Let  M  represent  the  “mask  function”  with  ^ 

M  =  <  ^  chromed  regions 

[  1  at  transparent  regions.  (5) 

The  illposedneee  of  the  problem  assurer  us  that  the  proximity  equation  M  =  E  has  no  solution 
In  reahty,  even  rf  here  ,s  no  back  scattering,  one  CANNOT  achieve  better  ene-°v  l‘ h' 
projected  aenal  .mage)  then  the  forward-peak,  and  probably  somewhat  worse  We  thenefom  “ffl,  ? 
fte  mask  funct.on.  Specifically  we  assert  that  the  best  one  can  obtain  is  the  mask  funXconvoM 
ith  some  filter  somewhat  wider  than  the  forward  peak  /  ^ 

We  now  introduce  the  modified  proximiry  equation; 

M  *  h  =  f  ^  D (I =■  M  ^  f  *  d  /gj 

with  d  representing  the  excess  niter  width.  Taking  Fourier  transrorn,  of  (6)  yields 


:  'T  rib  I  ^  7}B 

The  division  bv  1  -b  nS  and  tbe  oe’"'-' — n-  f-b«  :r„.=we  .  '  •  •  • 

tatton  of  the  ill  posedness  of  the  proLm."  -ranstormatton  .s  anotner  mantfes- 

senl°  ;“eldt?r  ^  "  'I®  -  -  Seo-trit 


b  2^  -ri-B-  ±..,). 


Taking  the  inverse  transform  yields  the  density  D  as  the  mask  functio-' 
aennite  convolution  M  ^  B . 


ninus  the  small  but  positive 


^  rjM  ^  B  Tj" M  *  5  *  5  m  ... 

defi^tr'D^obtl-n"  ^  ^^nction  contdns  domains  in  which  ii  varnishes.  Since  M  .  B  is  positive 

cennite,  D  ob  a  ms  negative  values  which  are  of  course  nonnhvsical.  This  non-ohvsical  -esult  is  a 
consequence  of  the  assumotion  that  even  ‘-e  modme-’  •  7  P^.  sical  msuit  is 

The  inverse  t-n.-nrrn  T  /  “  -Tiocineu  proximity  equation  admits  a  ^'good”  solution. 

the  abov"^^^  suffers  from  tne  same  fate  exhibited  at  (9).  It  shoulc  be  noted  that 

ne  above  analysis  is  general  and  not  dependent  on  the  Gaussian  or  any  other  mode’  Tbe  ^ormal 
solution  of  the  modined  proximity  equation  is  given  as  ' 

=  Inverse.[iV/d(l -b 

with  the  understanding  that  D  may  become  negative 

The  cuesrion  is  how  can  the  sofution  D  be  employed’  Two  obvious  answers  are: 
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.  (1)  “Shift”.  That  means  add  a  constant  dose  to  D  in  each  pixel  such  that  it  i,  i 

Tbs  method  .s  potentially  more  successful  for  positive  resists  than  negative  chemically'al°piiS 

S?  Jh?"'  ’^n"'  ”rr  i'  “tw-s  a  negative  solution, 

tion  depositeri:  fheTit'  ^ 

/.m?  ^  m^hine  setting  it  is  natural  to  question  which  of  these  two  options  is  better 

Ip  the  rert7  "  — ■  Furthermore,  what  is  the  "besf  way  to  shift  Iwe  “  d 

It  is  clear  that  these  questions  must  be  answered  coupled  to  the  resist  processing  are  resis* 
dependent  and  vary  from  system  to  system.  In  this  paper  we  demonstrate  the  sItioTo  1: 
modified  proamity  equation  for  simple  features  as  weU  as  the  prototvpe  “NIKON  M  bI 

the  shifted  and  chopped  versions  of  the  resultant  energy  distrLtiomIe  clsidertd  fasi  buf^ot 
least,  -the  mixed  algonthm  is  under  active  consideration. 

3  Examples 


We  begm  our  discussion  by  specifying  the  explicit  form  of  the  PS  function  P,  Following  K'.= 
we  moce*  the  unnormalizea  point  spread  function  P  as  at  z,,  by  °  ' 

P  =  exp{-r^-/\3j)-r){3f/f3kVex-p{-r\^/3l) 


with 


ser  ;i] 


(11) 


r“  = 


-  X,)- -  {y  -  y,y 


( ‘  2^ 

snd  /3,  and  A  represent  the  half  width  of  the  forward  and  backward  scatterin.  nealcs.  and  r,’a) 
depth  of  fnU.-«t"  numoer  of  bacxscattered  to  forward  sca::ered  electrons  at  the 

It  should  be  emphasized  that  we  /q  ,  r 

the-  are  ^  iT  .  V  Constants  for  convenience  oniv.  Ir  cr.ae^a^ 

are  lunctions  oi  depth  a-r  rpn  l 

V  .  .  ■  -^  \  /  ca*.  oe  L*.eaveci  aS  such  witn  no  extra  cor^nlirsfinr^  i  ' 

aiscussion  is  available  in  reference  [9h  co...piication.  A  ae.ailec 

The  parameter  5^,  and  77  are  take^  fro^  ^e^e^p-rp  a-nrJ  ^  ^ 

the  notation  =  23^  -e.e.e.ce  anc  ana  ^ve  use  lor  the  nrst^xample 

j  j  * 

featmfs  sraphically  the  “shifted”  algorithm  for  a  1  -  d  mask  with  4 

•  igure  la  msplays  tne  point  snread  function,  fiir  lb  t’-^e  Gauss’a’””  fiitp’-  fio-*  i 

haf  hi  hf  i*  about  4%  ft  w  del 

memorv  dll  onO.“I  reported  in  ref.  [8).  ll  is  a  logic, 

niask  is  illustrated  in  ha  The  feature  sizes  are  all  .35/..  The 

'iae  y  =  5  as  well  as  discussing  the  mask! 


Tne  parameters  used  in  this  example  are  displayed  in  Table  1. 


SPIE  Vol.  1 924  992)  /  i  5 j 


Table  1 

Energy 

A 

Source 

■  *1  : 

lOKev 

.22 

.65 

.51- 

ref  9 

1 

25Kev 

.06 

2.6 

.51 

ref  9 

.7  '  .  ' 

30KeV 

.21 

3.84 

.70 

ref  7 

40KeV 

.04 

6.0 

.42 

ref  9 

When  the  aerial  image  of  the  NM  is  computed  the  minimum  resolution  is  64  x  96.  For  the  ' 
purpose  of  this  paper  we  have  chosen  256x256  grid  for  all  cases  involving  the  NM.  ' 

Figure  3  presents  level  curves  as  a  color  distribution  of  the  "chopped”  density  at  25keV,  while  ' 
fig.  4a  shows  the  same  data  file  in  a  relief  surface  form  and  fig.  4b  offers  a  sectioned  display  of  the 
same  surface.  ^ 

T 

.1 

Figures  5a-5d  compare  the  beam  intensity  for  lOkeV  and  25keV  with  both  the  "chopped”  and  ' 
"shifted”  versions.  As  discussed  earlier,  the  "intensity”  represents  the  actual  energ}'  deposited  m 
the  resist  during  the  process  of  E-BEAM  exposure. 

All  the  remaining  examples  are  concentrated  on  the  cut  line  y  =  o.  In  fig.  6a,  6b  we  offer  the 
extract  of  the  actual  solution  of  the  modified  proximity  equation  at  30keV  as  well  as  40keV.  The 
snarpness  of  the  forward  peak  at  40keV  is  believed  to  be  responsible  for  the  fast  oscillations  near 
the  nominal  mask  edge.  Furthermore,  fig.-7a-7d,  8a-8d  corresponds  to  the  actual  intensity'  for  both 
chopped  and  shifted  cases  for^all  energies  considered  in  Table  1.  It  is  clear  that  for  a  fast,  high 
contraSy  negative  resist,  the  cnopped  is  uniformly  superior.  Hov'ever.  the  situation  is  somewhat 
more  vague  when  slow  positive  resist  is  used  requiring  further  analysis.' 

We  now  turn  our  attention  to  the  resist  simulation  process.  The  ”chop-25keV”  energv  profile 
was^selected  for  the  actual  exposure  simulation  ,  using  SAL-601  resist  layer  .5^  thick.  The  acid 
production  of  the  negative,  chemically  amplified  resist  was  modelled  with  the  classic  DilLs  equation, 
employing  the  modified  MCP[6h  The  post  exposure  bake(PEB)  was  carried  out  with  MCP’s*system 
or  reaction  diffusion  equations  and  the  dissolution  was  simulated  using  .  .. 

MCP’s  Hamilton-Jacobi  algorithm. 

In  fig.  9a,9b  we  compare  the  results  of  the  simulation  for  the  line  y  =  o  for  40sec  and  60sec 
PEB  and  60sec  dissolution  time.  At  60sec  the  lines  have  not  opened  up  as  yet  but  are  expected  to 
open-up  for  longer  dissolution  time.  On  the  other  hand,  the  40sec  PEB  profiles  open  up  very  neatly, 
providing  near  optimal  conditions  for  0.35/z  feature  size  E-beam  lithography.  In  fig.  10  we  exhibit  - 
a  comparison  of  the^ ’’chopped”  and  ’’shifted”  lOkeV  and  25keV  final  profile  after  60sec  PEB.  Only  . 
the  cnopped  25keV”  is  a  viable  candidate  for  manufacturing  a  lx  Nikon  Mask  at  .35/i  feature  size.  ' 

N. 

4  Summary  and  Conclusions 

We  have  demonstrated  that  0.35/i  lx  mask  is  feasible  to  manufacture  with  SAL-601  resist.  Wit^  : 
optimization  of  resist  processing  coupled  to  an  optimization  scheme  for  density  selection,  w’e  expect 

to  develop  an  algorithm  providing  0.25/z  feature  size  masks.  '  '  '  .  V 
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